a new job scheduling in data grid environment based on data and computational resource availability

نویسندگان

najme mansouri

چکیده

data grid is an infrastructure that controls huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. the heterogeneity and geographic dispersion of grid resources and applications place some complex problems such as job scheduling. most existing scheduling algorithms in grids only focus on one kind of grid jobs which can be data-intensive or computation-intensive. however, only considering one kind of jobs in scheduling does not result in suitable scheduling in the viewpoint of all systems, and sometimes causes wasting of resources on the other side. to address the challenge of simultaneously considering both kinds of jobs, a new integrated job scheduling strategy (ijss) is proposed in this paper. at one hand, the ijss algorithm considers both data and computational resource availability of the network, and on the other hand, considering the corresponding requirements of each job, it determines a value called w to the job. using the w value, the importance of two aspects (being data or computation intensive) for each job is determined, and then the job is assigned to the available resources. the simulation results with optorsim show that ijss outperforms comparing to the existing algorithms mentioned in literature as number of jobs increases.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Job Scheduling in Data Grid Environment Based on Data and Computational Resource Availability

Data Grid is an infrastructure that controls huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. The heterogeneity and geographic dispersion of grid resources and applications place some complex problems such as job scheduling. Most existing scheduling algorithms in Grids only focus on one kind of Grid jobs which can be data...

متن کامل

A New Job Scheduling in Data Grid Environment Based on Data and Computational Resource Availability

Data Grid is an infrastructure that controls huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. The heterogeneity and geographic dispersion of grid resources and applications place some complex problems such as job scheduling. Most existing scheduling algorithms in Grids only focus on one kind of Grid jobs which can be data...

متن کامل

Reliability and Availability Improvement in Economic Data Grid Environment Based On Clustering Approach

Abstract - One of the important problems in grid environments is data replication in grid sites. Reliability and availability of data replication in some cases is considered low. To separate sites with high reliability and high availability of sites with low availability and low reliability, clustering can be used. In this study, the data grid dynamically evaluate and predict the condition of t...

متن کامل

An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity

The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...

متن کامل

reliability and availability improvement in economic data grid environment based on clustering approach

abstract - one of the important problems in grid environments is data replication in grid sites. reliability and availability of data replication in some cases is considered low. to separate sites with high reliability and high availability of sites with low availability and low reliability, clustering can be used. in this study, the data grid dynamically evaluate and predict the condition of t...

متن کامل

Data Replication-Based Scheduling in Cloud Computing Environment

Abstract— High-performance computing and vast storage are two key factors required for executing data-intensive applications. In comparison with traditional distributed systems like data grid, cloud computing provides these factors in a more affordable, scalable and elastic platform. Furthermore, accessing data files is critical for performing such applications. Sometimes accessing data becomes...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
amirkabir international journal of modeling, identification, simulation & control

ناشر: amirkabir university of technology

ISSN 2008-6067

دوره 47

شماره 1 2015

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023